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Abstract 

In the last decades, the study of models for large real- world networks has been a very popular 
and active area of research. A reasonable model should not only replicate all the structural prop- 
erties that are observed in real world networks (for example, heavy tailed degree distributions, 
high clustering and small diameter), but it should also be amenable to mathematical analysis. 
There are plenty of models that succeed in the first task but are hard to analyze rigorously. On 
the other hand, a multitude of proposed models, like classical random graphs, can be studied 
mathematically, but fail in creating certain aspects that are observed in real-world networks. 

Recently, Papadopoulos, Krioukov, Boguna and Vahdat [INFOCOMT0] introduced a ran- 
dom geometric graph model that is based on hyperbolic geometry. The authors argued em- 
pirically and by some preliminary mathematical analysis that the resulting graphs have many 
of the desired properties. Moreover, by computing explicitly a maximum likelihood fit of the 
Internet graph, they demonstrated impressively that this model is adequate for reproducing the 
structure of real graphs with high accuracy. 

In this work we initiate the rigorous study of random hyperbolic graphs. We compute exact 
asymptotic expressions for the expected number of vertices of degree k for all k up to the 
maximum degree and provide small probabilities for large deviations. We also prove a constant 
lower bound for the clustering coefficient. In particular, our findings confirm rigorously that 
the degree sequence follows a power-law distribution with controllable exponent and that the 
clustering is nonvanishing. 



1 Introduction 



Modeling the topology of large networks is a fundamental problem that has attracted considerable 
attention in the last decades. Networks provide an abstract way of describing relationships and 
interactions between elements of complex and heterogeneous systems. Examples include technolog- 
ical networks like the World Wide Web or the Internet, biological networks like the human brain, 
and social networks which describe various kinds of interactions between individuals. 

An accurate mathematical model can have enormous impact on several research areas. From 
the viewpoint of computer science, an obvious benefit is that it could enable us to design more 
efficient algorithms that exploit the underlying structures. Moreover, the process of modeling may 
suggest and reveal novel types of qualitative network features, which become patterns to look for 
in datasets. Finally, an appropriate model allows us to generate artificial instances, which resemble 
realistic instances to a high degree, for simulation purposes. Unfortunately, from today's point of 
view, a significant proportion of the current literature is devoted only to experimental studies of 
properties of real-world networks, and there has been only little rigorous mathematical work. 

There are (at least) two requirements for a reasonable model for real-world networks. First, it 
must be able, when setting the parameters appropriately, to replicate the salient features of the real- 
world graphs under consideration. Moreover, a second desired property is that the model should be 
mathematically tractable and simple enough to be of use in large scale simulations. There are plenty 
of models that satisfy the first criterion, but are hard to analyze from a mathematical viewpoint. 
On the other hand, there exists a plethora of analytically tractable models, which unfortunately 
do not yet replicate satisfactory enough the properties that are observed in large networks. 

In this work we initiate the rigorous study of a class of models for large networks, the so- 
called random hyperbolic graphs. Such graphs were shown empirically to have startling similarities 
with several real- world networks, and in particular with the Internet graph (i.e., the network formed 
by the routers and their physical connections). Before we describe the model and our results, let 
us proceed with considering some properties of large networks in a little more detail. 

Properties of large networks. Since the 60's, the study of networks of various kinds has grown 
into a significant research area. One of the initiators in this field, the sociologist Stanley Milgram, 
investigated the network that is obtained from the relationships among people [19tl28|. In his work 
he discovered what is nowadays known as the small-world phenomenon, which postulated that the 
distance between two random people is on average between five and six. Outside the context of 
social networks this has become synonymous to a graph with a comparatively low diameter /average 
path length, and nowadays many networks are known to possess this property [2| [3l [29]. 

Another property that is found in many networks addresses the degree distribution. In a 
celebrated paper, Faloutsos et. al. [H] observed that the Internet exhibits a so-called scale-free 
nature: the degree sequence follows approximately a power-law distribution, which means that the 
number of vertices of degree k is proportional to some inverse power of k, for all sufficiently large 
k. This sets such a network dramatically apart from e.g. a typical Erdos-Renyi random graph and 
stirred significant interest in exploring the causes of this phenomenon. From today's viewpoint, it 
is well-known that many graphs have a heavy-tailed degree distribution, which may be close to a 
power-law or a log-normal or a combination of these distributions (see [20] and references therein). 

A third distinctive feature of large real- world graphs is the appearance clustering \21 \ \26 \ [29] . 
The network average of the probability that two neighbors of a random vertex are also directly 
connected is called the clustering coefficient. Measured clustering coefficients for social networks 
are typically tens of percent, and similar values have been measured for many other networks as 
well, including technological and biological ones. 
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Models of large networks. Perhaps the first step towards a random graph model for real-world 
networks was made by Watts and Strogatz |29j in 1998, who addressed the small-world phenomenon 
and clustering and gave reasons for its emergence. However, the degree distribution of the generated 
graphs follows a Poisson distribution, and thus is not heavy tailed. Barabasi and Albert proposed [5] 
that the cause for power-law degree distributions is preferential attachment: the networks evolve 
continuously by the addition of new vertices, and each new vertex chooses its neighbors with a 
probability that is proportional to their current degree. This model was shown by Bollobas et 
al. [7] to produce power-law degree distributions, but on the other hand it generates graphs that 
typically have a vanishing clustering coefficient [8]. Nevertheless, the Barabasi- Albert model was 
the beginning of a vast series of proposed models that suggested mechanisms according to which a 
network can evolve (see e.g. [H El QUI EH E21 E3 EE] for a non-exhaustive but representative list). 

Hyperbolic random graphs. An alternative and fruitful approach towards understanding the 
structure and the dynamics of real-world networks is to attempt to describe the similarities or 
dissimilarities between vertices in a well-defined and formal sense. One possibility in this direction 
is based on the idea of assigning virtual coordinates to the vertices, i.e., the network is embedded 
in some metric space such that the mutual distances abstract the resemblance among the vertices. 

One natural choice for the underlying metric space is the Euclidean space. In this context, 
Ng et al. [22] proposed to embed the Internet graph into such spaces. Their original aim was 
to predict distances in the network by simply comparing coordinates. The authors obtained a 
reasonable mapping in 5 and 7 dimensions, but not without distortion and errors. Shavitt and 
Tankel [27] later observed that this embedding becomes dramatically better when replacing the 
Euclidean geometry with a negatively curved hyperbolic space. 

The above considerations lead us immediately to the model of Random Geometric Graphs. Such 
a graph is generated by placing independently and uniformly at random n vertices in, say, [0, l] 2 , 
and creating edges whenever the (Euclidean) distance of two vertices is at most some r = r(n). 
These graphs have been studied intensively by many authors because of connections to percolation, 
statistical physics, hypothesis testing, and cluster analysis [25 1. Unfortunately, these results provide 
strong evidence that Euclidean geometry is not the adequate choice if one wants to describe large 
real- world networks, as the qualitative characteristics of the resulting random networks (like the 
average path length or the degree sequence) are very far from the ones observed in practice. In 
other words, the underlying geometry capturing the main structural characteristics of real- world 
networks is not Euclidean, and the important question is whether there exists an appropriate choice 
of a geometry giving rise to the observed features. 

A preliminary answer to this question was given by Papadopoulos, Krioukov, Boguha and 
Vahdat |24j . The authors demonstrated impressively that complex scale- free network topologies 
with high clustering coefficients emerge naturally from hyperbolic metric spaces. Their model, 
which we will denote by random hyperbolic graph, consists in its simplest variant of the uniform 
distribution of n vertices within a disk of radius R = R(n) in the hyperbolic plane, where two 
vertices are connected if their hyperbolic distance is at most R. The authors show via simulations 
and some preliminary theoretical analysis that the generated graphs exhibit a power-law degree 
distribution, whose exponent can be tweaked via model parameters. Further, the authors indicate 
that with a slightly more complex model they can also control the clustering of the generated graphs 
to bring it in line with real-world networks. 

To make their case, Boguha, Papadopoulos and Krioukov computed in [6] an embedding of 
the Internet graph into the hyperbolic plane by finding the maximum likelihood match to the 
model that was described above, and demonstrated impressively that this embedding has many 
desirable properties. For example, the authors examined the performance of greedy routing using 
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the hyperbolic coordinates, i.e. the scheme in which each node forwards an incoming message to a 
neighbor that is closest to the destination, see also the works of Kleinberg [15] and Papadimitriou 
et al. [23j. In their embedding, this simple greedy forwarding strategy exhibits a remarkably strong 
performance and connects 97% of all vertex pairs. The average stretch factor between chosen and 
optimal path is around 1.1, suggesting that greedy paths are very close to optimal. They also 
showed that this performance remains strong even if a fraction of the nodes is allowed to fail. 

Our contribution. Regarding the experiments just described, it seems at least fair to say that 
random hyperbolic graphs provide an attractive model that has a high potential of being adequate 
for describing the characteristics of many real-world networks. Moreover, a simple formulation and 
a strong affinity to random geometric graphs indicate that this model might be mathematically 
tractable. In this work we show that this is indeed the case and initiate thereby the rigorous 
study of hyperbolic random graphs. First, we prove a constant lower bound on the clustering 
coefficient of hyperbolic random graphs which confirms the claimed high clustering. We then show 
that the expected degree distribution indeed follows a power-law across all scales, i.e., even up to 
the maximum degree. Note that in the seminal papers [24J and [16j the degree distribution was 
also considered, however only for constant degrees and without any error guarantees. In addition, 
we prove small probabilities for large deviations, i.e. we show that sampling from this distribution 
returns with high probability a graph with the desired properties, which is crucial for validating 
experimental results. We also compute tight bounds for the average and maximum degree that 
hold with high probability. 

There are many models for which either a power-law degree sequence [U [7J [10] or a large 
clustering coefficient [29J has been proven. But this is the first model which provably satisfies both 
properties. Note also that while there are some models (see for example [7J and [TD]) for which a 
power-law degree distribution up to polynomially large degrees can be showed, to the best of our 
knowledge this is the first rigorous proof that the degree distribution of a random graph model is 
scale-free up to the maximum degree. Further, our results reveal some fundamental combinatorial 
properties of the model, thus setting the groundwork for further theoretical investigations. We 
strongly believe that these facts together with the nice combinatorial structure of the model make 
it attractive for the theoretical computer science and random graph community. 



Let us begin this section with a few facts about the geometry of hyperbolic planes. We will restrict 
ourselves to the most basic notions, and refer the reader to e.g. [I] and many references therein for 
an extensive introduction. 

First of all, there are many equivalent representations of the hyperbolic plane, each one high- 
lighting different aspects of the underlying geometry. We will consider here the so-called native 
representation, which was described by Papadopoulos et. al. in [24J , as it is most convenient for 
defining the model of random hyperbolic graphs. 

One basic feature of the hyperbolic plane is that it is isotropic, meaning that the geometry is 
the same regardless of direction. In other words, we can distinguish an arbitrary point, which we 
call the center or the origin. In the native representation of the hyperbolic plane we will use polar 
coordinates (r, 6) to specify the position of any vertex v , where the radial coordinate r equals the 
hyperbolic distance of v from the origin. Given this notation, the distance d of two vertices with 
coordinates (r, 9) and (r', 9') can be computed by solving the equation 



2 Model & Results 



cosh((f) = cosh(r) cosh(r') — sinh(r) sinh(r') cos(# — 9') 
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where cosh(x) = (e x + e~ x )/2 and sinh(:r) = (e x — e~ x )/2. For our purposes we will denote from 
now on by d(r, r',9 — 9') the solution of (12. ip for d. 

The crucial difference between the Euclidean and the hyperbolic plane is that the latter contains 
in a well-defined sense more "space" . More specifically, a circle with radius r has in the Euclidean 
plane a length of 2irr, while its length in the hyperbolic plane is 2tt sinh(r) = Q(e r ). In other words, 
a circle in the hyperbolic plane has a length that is exponential in its radius as opposed to linear. 

Based on the above facts, the authors of [24] defined a model of random geometric graphs that 
in its simplest version consists of the uniform distribution of n points into a hyperbolic disk of 
radius R = R(n) around the origin. Two points in this disk are connected by an edge only if they 
are at hyperbolic distance at most R from each other, as defined in (I2.ip . More precisely, note that 
the total area of a circle of radius r equals 



To choose the n points uniformly at random in the hyperbolic disk of radius R it suffices to choose 
for each polar coordinates (r, 9) such that 9 is chosen uniformly at random in the interval, say, 
(— 7r, 7r] and its radial coordinate r is drawn according to the distribution with density function 
sinh(r)/(cosh(i?) — 1), where < r < R. To add flexibility to the model, the authors of [24] use 
a slightly different density function for the radial coordinate: asinh(ar)/(cosh(aii!) — 1), where 
a > 1/2. For a < 1 this favors points closer to the center, while for a > 1 points with radius closer 
to R are favored. For a = 1 this corresponds to the uniform distribution. 

Let us now proceed to a formal definition of the model. With all the above notation at hand, 
the random hyperbolic graph G aj c(n) with n vertices and parameters a and C is defined as follows. 

Definition 1 (Random Hyperbolic Graph G Qi c*(n)). Let a > 1/2, C 6 I, n £ N, and set R = 
21ogn + C . The random hyperbolic graph G Qj c(n) has the following properties. 

• The vertex set V of G a< c(n) is V = {1, . . . , n}. 

• Every v G V is equipped with random polar coordinates (r v ,9 v ), where r v £ [0,R] has density 
p(r) := Q CO s'h(aii;)-i an ^ ® v * s drawn uniformly from [— ir, tt]. 

• The edge set of G aj c(n) is given by {{u, v} C ( 2 ) : d(r u ,r v ,9 u — 9 V ) < R}. 

The restrictions in the model parameters, especially the condition a > 1/2 and the definition 
of R will become clear in the sequel. Informally speaking, the choice of R guarantees that the 
resulting graph has a bounded average degree (depending on a and C only). If a < 1/2, then the 
degree sequence is so heavy tailed that this is impossible. 

Let us mention at this point that in [24] an even more general model was also proposed. There, 
each pair of vertices is connected with a probability that may depend on the hyperbolic distance of 
those vertices. In particular, this probability is large if the vertices have distance < R, and becomes 
quickly smaller when the distance is larger than R. We will not treat this model here. 

Let us next describe the results that we show for G ai c(n). First of all, we study the clustering 
of hyperbolic random graphs. The local clustering coefficient of a vertex v is defined by 





(2.2) 
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where T(v) := {u \ {u,v} G E}. The global clustering coefficient of a graph G = (V,E) is the 
average over all local clustering coefficients 



n 

Our first theorem gives a constant lower bound on the global clustering coefficient which holds with 
high probability. 

Theorem 2.1. Let a > 1/2, C G M and c = c(G Qj c(n)). T/ien E[c] = 6(1) and wit/i 
probability c = (1 + o(l))E[c]. 

Then we study the degree sequence, and provide sharp bounds for the number of vertices of 
degree k. 

Theorem 2.2. Let a > 1/2 and C G R. .Sei 5 = min j ^a+ija » 5(2a+ij } • Then, with high 

probability, for all < k < n &> , where 5' < 5, the fraction of vertices of degree exactly k in G a) c(n) 
%s 

* + *» ^ i^BwT - -) - r • <-> 

where T(x) = J °° t x ~ 1 e~ t dt denotes the Gamma function and £ = ^^-1/2) e ~ C ^ 2 ■ If n& <k< " og ^ 
i/ien wni/i /ug/i probability the fraction of vertices of degree at least k in G a) ci n ) is 

Note that this result demonstrates that the degree sequence of G a) c{n) is a power-law with 
exponent 2a + 1 > 2. To see this, note that for sufficiently large k we have that T(k — 2a) /kl = 
G(/c _2a_1 ), i.e., (|2.4p and (12.51) imply that the number of vertices of degree k in G aj c{n) is (1 + 
o(l))c Q , j c , fc _2a_1 'T' ) for an appropriate c a ,c > 0. 

Our next result gives bounds for the average degree of G a> c(n). 

Theorem 2.3. Let a > 1/2 and C G M.. Then the average degree of G a ,c(n) is (1 + o(l)) jjfejjT^p ■ 

Note that Theorem 12.21 and Theorem 12.31 confirm the results in |24j . Finally, we give sharp 
bounds for the maximum degree in G at c(n). 

Theorem 2.4. Let a > 1/2 and C G K. T/ien £/ie maximum vertex degree of G a fi{n) is with high 
probability n^^' 1 '. 

In this work we focus on the degree distribution and the clustering of hyperbolic random graphs. 
We believe that the the diameter of the giant component and the performance of greedy routing 
using the hyperbolic coordinates are interesting questions for future work in this area. 



3 Properties of the Model 



Recall that according to Definition [T] the mass of a point p = (r,9) is f(r) = 2 7r(cosh( 1 i < R)-i) ' anc ^ 
does only depend on the radial coordinate of p. Accordingly, we define the probability measure 
fj-(S) of a point set S as 

MS) = / f(v)dy. (3.1) 
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A vertex located at (9, r) is connected to all vertices with coordinates (9',r') such that d(r,r',9 — 
9') < R. Let us define the ball of radius x around a point (r, 9) as 

B rjB (x) = { (/, 9') I d(r, r',9-9')<x}. (3.2) 

Since in the definition of our model two points are connected if and only if they are at distance 
at most R, we will typically consider the intersection B rit g 1 (R) n f?o,o(-R) which corresponds to 
the point set in which all vertices are connected to a fixed vertex at (r\ ,9\). By (13, ip we can 
determine the probability measure of such a set by integrating f(y) over all points in the set. In 
our specific case we achieve this by integrating first over all y £ [0, R] and then over all 9 such that 
d(r\,y,9i — 9) < R. As f(y) does not depend on 9 we are only interested in the range of 9 for 
which this inequality is satisfied. One extremal of {9\ — 9) for which it is satisfied is clearly 

/cosh(ri) cosh(y) — coshfi?) \ 

9 n {y) = arg max {d[ri,y,4>) < R\ = arccos — - • (3.3) 

0<<^<tt y smh(ri) smn(y) / 

Because of symmetry of the cosine the other extremal is —9 ri (y) and we therefore have to integrate 
from —9 n (y) to 9 n (y) as all those angles 9 satisfy d(r\,y,9) < R. Therefore we arrive at the 
following expression: 



fi(B r>g (R)nB 0fl {R))= / f(y)d9dy = 2 / f(y)d9dy. (3.4) 

J J-8 r (v) JO JO 



rR rdr(y) rR re r {y) 

f(y)d9dy = 2 \ 

-B r (y) Jo Jo 

Note that fj,(B r fl(x)r\Bofi(R)) does not depend on 9 and therefore we shorten it to n(B r (x)PiBo(R)). 
Before we commence with the more technical part of this section, we quickly refresh the following 
basic estimates of cosh(x) and sinh(x). For all x > 

e z e z (x> 1/2 In 3) g£ 

— < cosh(x) < e x and — < sinh(x) < — . (3-5) 

We first prove a technical lemma that gives almost tight bounds on 9 r (y). 
Lemma 3.1. Let < r < R and y > R — r. Then 

9 r {y) = 2e^ (l + Q(e R -r-y)). 
Proof. By using (|3.3p and the trigonometric identity 

cosh(x ± y) = cosh(x) cosh(y) ± sinh(x) sinh(y). (3.6) 

we infer that 

sinh(r) sinh(y) + cosh(r — y] — cosh(ii) 



cos 



1 + 2 
1 + 2 



sinh(r) sinh(y) 

J-y + e -r+y e R + e -R 



{e r - e~ r )(ey - e~v) (e r - e" r )(ef - e~f) 
e~ 2r + e- 2y e R-r-y + e -R-r-y 



1 - e~ 2r )(l - e- 2 v) (1 - e- 2r ){l - e~ 2 v) ' 



Observe that both r and y are non-negative. By applying the identity 1/(1 — x) = 1 + Q(x), which 
is valid for all < x < 1, we obtain 



cos 



i(9 r {y)) = 1 + 2 (e~ 2y + e~ 2r - e R ~ r ~ y - e- R ~ r ~ y ) (l + 0(e~ 2r )) (l + Q{e~ 2y )) 



y ~= r 1 - 2e R ~ r - y + e(e~ 2y + e- 2r ) 
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In our next estimates we will get rid of the cosine in the above expression. By cos(#) > 1 — 4- we 
derive 

r {y) 2 > Ae R - r ~ y - G (e" 2r + e- 2y ) . 

Note that whenever \x\ < 1 

^TT^= 1 + | + G(x 2 ). (3.7) 
Applied to the previous equation, this gives a lower bound of 

R-r-v ( ^fe~ 2r + e- 2 y 



g r (y) > 2e~i- ( 1 - e ( ^ r _„ ))■ (3i 



git— i — y 



Next we will derive an almost matching upper bound for 9 r {y). First we exploit that cos(#) < 



1 — ^ + ^ and thereby 



2 4! 

This quadratic equation can be solved exactly by using basic tools. We omit the detailed calcula- 
tions, and show just the final outcome. We obtain that 



M,) 2 <o-^i-^--(i-e(£^±^)). 

Note that if |e- R ~ r_2/ (l - G ( e ~^u-r~y V )) < 1 then we can apply ([321) • If on the other hand we 
have y R - r ~y (l - G ( +% 2 " ) ) > 1 then G ((e^-^ - (e" 2r + e" 2 ^)) 2 ) = 9(1). With this we 



- 2 '-+e- 2 " 

g-K-r — y 

obtain 



^.(y) 2 < 4e*— (l - G ( e "^0 ) + ((^^ " ( e " 2r + e " 2 ')) 2 ; 

= 4e R " r -^ - G (e- 2r + e" 2 *) + G (e 2 ^"^ «)) - Q (e fl -^( e - 2r + e" 2 *)) + G (e- 4r + e" 4 ^) 
= 4e ii -'- 2/ (l + G(e jR - r - 2 ')) . 

Hence by applying again (13.71) we get 

0r(y)<2e^ (i + e(e R - r -y)) 

which together with (|3.8|) concludes the proof. □ 

Our second lemma gives precise estimates for the measures of several useful combinations of 
balls. It is heavily used in all the later calculations, and is an important ingredient of our proofs. 
Note that the claimed formulas look a bit overloaded on the first sight; however, the derived bounds 
make it applicable for different purposes as we will see in later parts of this paper. 

Lemma 3.2. For any < r < R and any < x < R we have 

fi{B (x))=e-^ R - x \l + o(l)) (3.9) 
n{B r {R) n B (R)) = J 2 ^'^ (l ± Oie^ 1 ^ + e~ r )) . (3.10) 
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Figure 1: The grey areas represent the the point sets considered in (|3.9|) . (|3.10|) . (|3.11|) and (|3.12p . Note 
that the native representation of the hyperbolic space is used, implying that the mass at larger distances 
from the origin grows exponentially fast. 



Further, for x < R — r 



n{{B r {R) n B (R)) \ B (x)) = J£ e _y 2) (l ± 0(e-( Q -V2)r + e -)) , (3.11) 
while for x > R — r it holds that 

fi{(B r (R)nB (R))\B (x)) = 

2ae- r ' 2 ( ^ f i + a-l/2 c _ 2 \ e _ {a _ 1/2){R _ x) \ / ± Q ^ r + e _ r _ {R _ x){a _ 3/2) 



7r(a- 1/2) V V a + 1/2 



(3.12) 



Before we continue with the proof of the lemma, let us give an intuitive description of the 
statement. Let us in particular consider (|3.1(J|) . as the subsequent equations are refinements of it. 
Equation (|3.10p states that the mass of the intersection of B r (R) and Bq(R) is, up to constants 
and error terms, equal to e~ r l 2 . Recall that in G a% c{n) every point in B r (R) n B (R) is connected 
to the point p with radial coordinate r and 9 = 0. Thus, the degree of p is a binomial distribution 
with parameters n and e~ T / 2 . In particular, if r is small, then the expected degree of p is large, 
and on the other hand, if r = 21ogn ~ R, then the expected degree of p is 0(1). In other words, 
the closer a vertex is located to the border of the disc, the smaller its degree will be, and (I3.10P 
allows us to quantify precisely the dependence. 

Proof of Lemma \3.S\ The definitions of fi and B, see (I3.ip and (I3.2p . imply that 

/r, / ^ o f x r, \j f x asinh(ay) cosh(evx) - 1 -MR-x) 

Im(B (x)) = 2vr / f{y)dy = / ' dy = -j— = (1 + o 1 e al - H x > . 

Jo Jo cosh{aR) — 1 cosh(ait) — 1 

This proves (|3.9|) . We compute fx(B r (R) n Bo(R)) as discussed in Equation (|3.4p . It follows that 

[■R [My) 

Li{B r (R) n B (R)) =2 / f{y)dddy 

Jo Jo 



where 6 r (y) is as defined in f|3.3|) . Note that for y < R — r we have 6 r (y) = tt due to the triangle 
inequality. We can therefore split the integral into two parts, and we obtain 

Li{B r (R) n B (R)) = v(Ba(R ~r))+2 9 r (y)f(y)dy. (3.13) 

J R-r 
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The first part can be computed directly with (|3.9p . With similar arguments we establish that for 
< x < R- r 

r R 

fi{(B r (R) n B (R)) \ Bq(x)) = fi{B (R - r)) - »{B (x)) + 2 / 9 r (y)f(y)dy (3.14) 

JR-r 

and for R — r < x < R 

r R 

Li{(B r (R)nB (R))\B (x)) = 2 / 9 r (y)f(y)dy. (3.15) 

J X 

In the sequel we use Lemma [3TT1 to compute the integrals in (|3.13p . (|3.14p and (|3. 15[) . We begin 
with 

2 />>, W! , . 2 (, ± (3,6, 

where we assume that the lower bound x is such that x > R — r. We first solve the integral for the 
leading term without the error term. A simple calculation shows that 

/e - ^ 2 sinh(ay)<i?/ = - — 7. -e~ y ^ 2 (2a cosh(ay) + sinh(ay)). 
4a z — 1 

Thus, 

2e~^~ — 7 — ; , " — ;dy = 
27r(cosh(ai?) - 1) 

4ae _r/ ' 2 (2a cosh(ai?) + sinh(ai?) — e 2 (2a cosh(ax) + sinh(aa;)) 
= vr(4a 2 -l)(cosh(ai?)-l) ' (3 " 17) 

Expanding all trigonometric terms to their definition as sums of exponential functions and the fact 
that l/(cosh(a J R) - 1) = 2exp(-ai?)(l + Q(e- aR )) we obtain 



e 



-(a-l/2)(R-x) 



+ («-!) [e- 2aR - e {R-*)/i-<*{R+*)\ (1 + Q( e ~ aR )) (3.18) 

2ae ~ r/2 t - s (l-(l + ^Z±ll e - 2 A e -(«-i/2XR-x)\ h +Q ( e -«*)) . 
7r(a- 1/2) V V a + 1/2 J J K V >> 



The integral over the error term in (|3.16j) is at most 

/ 3(i?-i 

O e 2 



cos y(y. J J x 



Combining the above with Equation (|3.18p we finally have 
R 9 r (y)f(y)dy = 



2ae ~ r ' 2 l - ( 1 - ( 1 + a " V2 e- 2 ^ e -(«-V2)(*-A f 1 ± 0(e -r + 
vr(a- 1/2) V V a + 1/2 y J \ { 



e -r-(R-x)(a-3/2) 



(3.19) 
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For x = R — r we obtain 



2 / e r (y)f(y)dy = , ( 1 ± 0( e -W> + e 

Jr-t 7r(a - 1/2) V 



which results in 



fi(B r (R) n B (R)) = e~ ar (l + o(l)) + ^ ^ (l ± Ofc"^- 1 /^ + ef 
' 7r(a — 1/2) V v 

Note that the term e~ ar can be written as e - r / 2 ~( a - 1 / 2 ) r anc [ can be incorporated into the first 
error term, proving our claim. 

For fj,((B r (R) n Bq(R)) \ Bq(x)) and < x < R — r we obtain the same bound as in this regime 
/j,(Bq(R — r)j asymptotically dominates [i(Bo(x)). The solution for R — r < x < R is given by 
Equation (|51H . 

□ 

The next lemma confirms the somewhat intuitive fact that a ball around a point has a higher 
measure the closer the point is located to the center of the disk. 

Lemma 3.3. For all < tq < R, all tq < r < R and all < x < R 

fi(B (R) n B ro (R)) > ti{B (R) n B r {R)) 

and 

V(B (R) n B ro (R) \ B (x)) > fi(B (R) D B r (i2) \ B {x)). 

Proof. Because of Equation (|3.13p . f)3. 14|) and f)3. 15|) it suffices to show that O ro (y) > 9 r (y) for 
< y < R. To see this, recall first that 9 r (y) is given by the solution of 



Gr(y) = arccos 



cosh(r) cosh(y) — cosh(i?) 
sinh(r) sinh(y) 



The claim follows, since cosh W^ 3 ^ (v)^ ™ sh ( R ) [ s increasing in r and arccos is decreasing. □ 



4 Proofs of the Main Results 

Before we give the proofs for our theorems, let us briefly describe a technique which we use to show 
concentration for the clustering coefficient and the degree sequence. We wish to apply an Azuma- 
Hoeffding-type large deviation inequality (see Lemma 14.11 below) to show that the sum of the local 
clustering coefficients X := Y2vev c *> an< ^ ^ ne number of vertices of degree k are concentrated 
around its expectation. 

In a typical setting, such concentration inequalities require some kind of Lipschitz condition 
that is satisfied by the function under consideration. In our specific setting, the functions are X 
and Df-, and it is required to provide a bound for the maximum effect that any vertex has. However, 
the only a priori bound that can be guaranteed is that for example the number of vertices of degree 
k can change by at most n, as a vertex may connect or not connect to any other vertex. To make 
the situation worse, this bound is even tight, since a vertex can be placed at the center of disc, i.e., 
if it has radial coordinate equal to 0. 

We will overcome this obstacle as follows. Instead of counting the total number of vertices of 
degree k and the sum of all the local clustering coefficients, we will consider only vertices that lie 
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far away from the center of the disc, i.e., which have radial coordinate larger than f3R, for some 
appropriate /3 > 0. Moreover, we will consider only vertices such that all their neighbors have a 
large radial coordinate as well. This restriction will allow us to bound the maximum effect on the 
target function, as with high probability all these vertices do not have too large degree. 

More formally, we proceed as follows. We partition the vertex set of G a c(n) into two sets. The 
inner set I = I (($) contains all vertices of radius at most f3R while the outer set O = 0(/3) contains 
all vertices of radius larger than j3R. 

We will use the following large deviation inequality. Let / be a function on the random variables 
Xi, . . . ,X n that take values in some set A4. We say that / is Lipschitz with coefficients c\ . . . c n 
and bad event B if for all x, y € A 

\E[f\X u . . . Xi = x,B}- E[f\X u . . . X^Xi = y,B}\< q. 

(We denote by B the complement of B.) Then the following estimates are true. 

Theorem 4.1 (Theorem 7.1 in [I3j). Let f be a function of n independent random variables 
X\, . . . ,X n , each Xi taking values in a set A4, such that E[/] is bounded. Assume that 

m<f(X 1 ,...,X n )<M. 

Let B any event, and let C{ be the maximum effect of f assuming the complement B of B: 

max|E[/|Xi, ...,X i - 1 ,X i = x,B) - E[f\X 1: . . . , X<_i, X t = y,B}\ < a. 



x.y 



Then 
and 



Pr[/ > E[/] +t + (M-m) Pv[B]] < e' 21 ' c ? + Pr[B] 



Pr[/ < E[/] — t— (M — m) Pr[S]] < e _t /E » C « +Pr[B]. 

In our setting, the random variables of interest are usually functions of X±, . . . ,X n , where Xi 
denotes the coordinates of the ith vertex. For the clustering coefficient and the degree sequence a 
coordinate change can not have a large effect on the random variable as long as the degree of the 
corresponding vertex is small. The following lemma says that in 0(f3) the degrees of the vertices 
is bounded with high probability. 

Lemma 4.2. Let a > 1/2 and < f5 < 1. There is a constant c > such that the probability for 
the bad event 

B := lythere is a vertex in 0(/3) = Bq(R) \ Bo{f5R) with degree at least cn 1_/3 | 
is at most Pr[B] = e -^(™ 1_/3 ). 

Proof. Note that unless B holds every vertex in O := 0(/3) is connected to at most cn 1 ~ /3 other 
vertices in O. Therefore, any change in Xi can increase or decrease the number of vertices in O of 
degree k by at most cn l ~@ + 1 (the additional "+1" is due to the fact that vertex i could change 
its degree as well). It remains to bound the probability of B. By applying Lemma 13.31 we see that 
the expected degree in O of a vertex with radius r > f3R is at most 

n ■ n(B (R) n B m (R) \ B (f3R)) (Le = E3 O (n ■ e"^) = 0^). 



Hence the expected degree of a vertex of radius at least f3R is at most c'n 1 13 for some constant d . 
For c := 2ec' it suffices to apply a Chernoff bound to show that for a vertex v of radius at least (3R 

Pv[do(v) > cn 1 - 13 ] < 2- cnl ~ P . 
The statement of the lemma follows by union bound over all vertices. □ 
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4.1 The Clustering Coefficient 

Recall the definition of the local and global clustering coefficient in (|2.2p and ()2.3|) . We will need 
the following technical statement, which gives an estimate for the measure of the intersection of 
the balls around two coordinates (ri,9\) and (^,#2) if their angle difference 9 := \9i — #2! is very 
small. 

Lemma 4.3. Let /3 > 1/2, j3R < x < R, r x > r 2 > x and < 9 < e~ r2 / 2 - e~ ri / 2 . Then 

ix (B (R) n B ru0 (R) n B r2j6 (R) \ B (x)) = fi (B (R) n B ri (R) \ B (x)) . (4.1) 
Proof. It follows from similar observations like the ones that lead to (|3.4j) that 

rR rxmn{0 ri (y),e+er 2 (y)} 
/max{-0 ri (y),e-6r 2 (y)} 

For 9 < e" r2 / 2 - e" ri / 2 and n > r2, using Lemma |3. 11 it can be verified that 

max{-9 ri (y),9-9 r2 (y)} = -9 n (y) and min{9 n (y), 9 + 9 r2 (y)} = 9 n (y). 

□ □ 
Theorem \2.1[ Let f3 := 2/3 and for a graph G = (V,E) let 

|{{«i, « 2 } G -E I U!,u 2 G r(u)}| 



/.R r mm{e ri (y),e+e r2 (y)} 

v(B (R)nB n>0 (R)nB r2>e (R)\B (x))= / / f(y)d<t>dy. 



(deg(v)\ 
vev \ 2 / 

deg(i))>2 

and 

y ._ y- |{K«2}££ I U!,u 2 £T(y)nO(P)}\ 

Z-^ fdeg(v)\ 
»eo(fl v 2 / 

cZeg(u)>2 

Clearly, c = — and X > Y. It therefore suffices to derive a constant lower bound on E[Y] and 
to show that Y is concentrated around its expectation. Let EfY- | £] be the expected value of 
\{{ui,u2}ge \ ui,u2&r(v)nO(i3)}\ £ Qr a ver |- ex ^ w ith radius r conditioned on the event £ that the vertex 
\ 2 ) 

has degree at least 2. We observe that E[Y r \ 8] is exactly the probability that two randomly chosen 
neighbors u\ and U2 of a vertex v at radius r are connected. In order to derive this probability for a 
fixed vertex v at radius r, let us suppose that u\ is at coordinate (y, cp) G B (R) n B r (R) \ B (f3R). 
Note that by ([3.3p these coordinates satisfy < y < R and —9 r (y) < 4> < 9 T {y). Moreover, the 
probability for the event that u\ is at (y, 4>) is given by 

m 



fi(B (R)nB r (R))' 



The vertex 112 is connected to ui in such a way that {ui,U2} contributes to Y r only if U2 lies in 
the intersection of the balls B (R) n 5 r (i2) \ S (/3i2) and B (R) n \ B (PR). Therefore, the 

contribution of «2 to E[Y r | £], given the coordinates of 112, is 

y{B rfl {R) n By^R) n gp(g) \ gbW) 
n{B Q {R))nB r {R)) 
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Note that we choose U2 uniformly from all neighbors of v which makes it possible that u\ = ui- 
However, since the degree of v is at least 2 this event happens with probability at most 1/2. Putting 
all the above facts together implies that 

Wr 1 iJ -2j, R J_ 9r{y) (v(B (R)nB r (RW Hy - 

Since the term in the integral above does not depend on the angle, we can replace the integral from 
—8 r (y) to 9 r (y) by twice the integral from to 8 r (y). Further, we derive a lower bound on that term 
by integrating the radius only from r to R. Observe also that for r < y < R the upper boundary of 

the angle, r (y) ^ = 3 (1 + o(l))2e 2"^, is at least £ := e~2 — e~% and that therefore the term 
above is at least 

L R Jo f(v) ■ KB r ,o(R) n B vA R ) n B (R) \ B (f3R))dHy 
(fi(B (R)nB r (R)\B ((3R))) 2 

Applying Lemma 14.31 this integral simplifies to 

1 



(fi(B (R)nB T .(R)\B (l3R))y 



r r i 

f(y) ■ fi(B (R) n B y (R) \ B (f3R))d<f>dy 



o 



l_J, LJ^ er _ aiJ / (e _ r/2 _ e -y/2 )e y(a-l/2) d(j)dl 



12 

J_ f r/2-aR 
12 ' 



e y(a-l/2) 



# a - 1/2 p r , „i-R 



(a - 1 

> J_ ( p -(R-r)/2 , 1 p -a(fl-r) _ a ~ V 2 -(fl-rA 

" 24 V + 2(a-l) a-1 J' 

Let be a vertex at radius R. It follows from Lemma 13.31 that the degree distribution of every 
vertex in the graph dominates the degree distribution of v R . Therefore, for any v £ V 



Pr[£] = Pr[deg(v) > 2] > Pr[deg(v R ) > 2] > Px[deg{v R ) = 2] 



n 



n—3 



(»(B (R) n B R (R))f(l - fi(B (R) n B R (R))) 

n p a 2 2Qe ~ fl/2 ra r a 2 2"°~ c/2 

> r> p~ n <= 7r(a-l/2) — p — ° p 7r(a-l/2) 

" ne vr2(a-l/2)2 6 _e vr2( a -l/2)2 e 

By integrating E[Y^. | £] over all /3-R < r < R and multiplying with Pr[£], f(y) and n we get the 
expected value of Y 

2ae -C/2 

EM = » / sr / to) • Priel • EI5 I «]* > ^ - l/2)( a + i)( tt + 1/2) " ^ 
Thus, E[y] = G(n). 

Set / := Y,t := n 6//7 and 'bad' event B and c as stated in Lemma I4T21 Note that each coordinate 
change can influence / by at most q := cn}~^ + 1 as long as B holds. It therefore follows from 
Theorem gj] and Lemma 0~J that Pr [X < E[Y] - n 6 / 7 - Pr[B]) = o(l) and therefore that the 
clustering coefficient is with high probability at least E[Y]/n = 0(1). □ □ 
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4.2 Vertices of Small Degree 

In this section we prove the first part of Theorem 12.21 Before we give all technical details, let us 
describe briefly the main proof idea. Given the estimates in the previous sections, in particular 
Lemma 13.21 it is conceptually not very difficult to compute the expected number of vertices of 
degree k in G a c(n). Nevertheless, it is not clear how to show the claimed strong concentration 
bound. To this end, we will apply the same large deviation inequality as in the previous section. 

Recall our partitioning of the vertex set of G Qj c(n) into the inner set I = I(/3) and the outer 
set O = 0(/3). Moreover, let e(I,0) count the number of edges with one endpoint in / and the 
other in O. The next two lemmas show that e(I, O) and |/| are small. This indicates that most of 
the vertices of degree k, for not too large k, will lie in O. Let denote the number of vertices in 
O which have degree k in O, i.e., set 

D k (p) = \{v€Otf) | \N(v)nO(/3)\ = k}\. 

In Lemma 14.61 we derive the expectation of D/-({3) and finally, we combine everything to show that 
.Dfc is tightly concentrated around its expectation. 

Lemma 4.4. In G aj c(n), with probability at least 1 — e _nn(1> 

< max{n 1 / 2 ,4en 1 - 2a ( 1 -«e- aC ( 1 -' 3 )}. 

Proof. The number of vertices in / is distributed like Bin(n, fi(Bo(/3R))). Therefore the expected 
number of vertices in / is (by Lemma \3.2\i bounded by 

E[|/(/3)|] < nn(B (pR)) = (1 + o{l))ne- a{R -^ R) < 2n 1 -Mi-/3) e -<*c(i-,9) 
and it follows by the Chernoff bound that for t = maxjn 1 / 2 , 4en 1_2Q ( 1_ ^e~ Q!C '( 1 ~ /3 " ) } 

Pr[|/(/3)| >t] < 2 -* = e- nn(1) . 

□ 

Lemma 4.5. Let e > and < (3 < 1. Then it holds with high probability that 

e{I{fi),0{p)) = O (n 1 -^- 1 ^^ logn) . 

Proof. Let ro := (l — logn + C and note that by Lemma 13.21 the expected number of vertices 
of radius at most ro is at most 

nfi(B (r )) = (1 + o(l))ne- a ^- r °) = ne~^ +1 ^ logn = o(l). 

We therefore have with high probability no vertex of radius at most ro. For all ro < r < (3R we get 
by Lemma 13.21 that the expected degree in O of a vertex at radius r is at most 

nfi{B r (R) n B {R) \ B (f3R)) = O (ne" r/2 ) . (4.3) 

We now integrate over fj4.3|) in / to bound the expected number of edges between / and O. Recall 
that there are with high probability no vertices of radius at most ro and it therefore suffices to 
integrate from ro to f3R 

O (n 2 e- r l 2 p{r)dr^j = O (n 2 e~ aR >~V2> dr \ = q ( n 2-2«+2^-l/2)^ 

= O (V-(2«-D(l-/3)) . 

The lemma follows by Markov's inequality. 

□ 
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In the next lemma we derive the expected degree sequence of the subgraph spanned by O. 

Lemma 4.6. Let a > 1/2, C 6 R, and max {3/5, l/(2a)} < P < 1. Set £ := min{2(2/3 - l),l/2}. 
Then, for all < k = o(n s ) we /tawe i/iai 

E[ft<ffl] _ (!+„(!)) 5^ (r(* - 2 a ) - jy^e-'O*) , 

where £ = , ^ e" c / 2 . 

Proof. Let p = (r, 0) be an arbitrary fixed vertex with r > /3J2. Denote with g r the probability that 
a random vertex p' = (r', 0') has radius at least f3R and distance at most R from p. We have that 

q r = fi((B r (R) n B (R)) \ B (f3R)). 

The probability that p has k neighbors with radius larger than f3R then corresponds to the proba- 
bility that a binomial random variable Bin(n — 1, q r ) has value k. Therefore the expected value of 
-Dfc can be computed by 

nD k ] =nJ R R {^~ - q r ) n - l ~ k p{r)dr. (4.4) 

By Lemma 13.2^ Equation (|3.12p . and the observation that @(e~ 2al3R ) = 0(e~ r ) it follows that 



2ae- r / 2 
vr(a - 1/2) 



1 - e 



-(«-i/2)( 



1-/3)*) ^ ± G ( e -r + e -r-( Q -3/2)(l-^^ 



Recall that i? = 21ogn + C and therefore e r < e ^ R = o(l/n). To approximate the integral in 
Equation (|4.4p we separate the main and error terms as follows. Write 

gr = _2ae^/ 1 _ e _ {Q _ 1/2)(1 _ /3)fl x ^ /r = ( e -r-(a-3/2)(i-^ +o(1/n)> 
7r(a — 1/2) V / v 

Note that q r = q r (l db / r ). Before we proceed with the estimation of the expression in (|4.4p let 
us prove some auxiliary facts. For all r > f3R and all a > 1/2 we claim to have the following 
properties for q r , f r and k: 

i) Qrfr = o(l/n), ii) (q r ) 2 = o(l/n), and iii) / r fc = o(l). (4.5) 

Property i) is established by observing that whenever f3 > 3/5 

g r / r = O ( e -(a-3/2)(l-0Ji-r/a-r) ^J fi ) ( e -(a-3/2)(l-/3)ii-3/lR/2) = Q ^3-6(S-2a + 2a^ = Q (y n y 

The second claimed property is true because (q r ) 2 = 0(e~@ R ) = 0(n~ 2/3 ) = 0(n -6 / 5 ), while the 
third is satisfied by our choice of k = o{n 2<y2 ^~ 1 ^) . We claim that these facts imply 

n J R ^~ 1 ^{q r f(l-q r ) n - l - k p{r)dr = {l + o^ (4.6) 

Consider first the binomial coefficient. Since k <C v/n we have 



Y)=^n(i-^)=< i+ °< 
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Next we consider the term (q r ) k - By using the third property in (|4.5p we obtain that 

(q r ) k = ((6.(1 + f r )) k = (q r ) k {l + Q(f r k)) = (1 + o(l))(q r ) k . 
Finally, we have that 

(l-Sr)"" 1 -* = (l-q r ) n ~ l ~ k (l + q r fr/(l-qr)) n ~ 1 ~ k 
= e {n - l - k)Xo ^ l - qr \l + e{nq r f r )) 

e _„ 9r+ o((n-l-fe) g 2)+(fc+l)g r + ^j\ 

e -«*(i+ (i)). 

This completes the proof of (|4,6|) . We now estimate the integral on the right hand side of Equa- 
tion (|4.6p . To do so we first perform a variable transformation t = nq r . Then, the density p(r) can 
be expressed as 

asinh(ar) ^ a , ar _ _ ar , 

P[r) cosh(aR)-l 2(cosh(aR) -l) [e 6 ' 



a 



2(cosh(<xR) - 1) I I vr(a-l/2) 

Since l/(cosh(ai?) — 1) = 2n~ 2a e~ aC (l + o(l/n)), the above calculation yields that 

P(r) = ae~« c ( — gL_ ) 2Q (**)-*»(! + (1)) . 

Further we have that 

2 a fl - e -(a-l/2)(l-/3)m j 

dt = n 1 ; — >-(-l/2)e~ r,2 dr = — tdr =>- dr = -2t _1 di, 

7r(a — 1/2) 2 

and for the upper and lower bounds that 

7r(a — 1/2) 7r(a — 1/2) 

r 1= R — > h = n 1 ; — '-e~ R ' 2 = i ; — J -e- c/2 . 

vr(a-l/2) 7r(a-l/2) 

Putting everything together we have that the right hand side of (|4.6p can be approximated by 



^ 1 " fc! W a- 1/2 J I, 



fc! Vvr(a-l/2)/ v Jo 
The proof is completed with the observation ii = (1 — o(l))£. □ 
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Proof of the first part of Theorem ] 2. SI Set /3 := max{|, l/(2a), 1 — ^} + e, where e > will be 
chosen later such that /3 < 1. The total number of vertices of degree k, for any k in the considered 
range, is at most 

D h (P) + \I(J3)\+e(I(P),0(p)) (4.7) 

and at least 

D k (/3)-e(I(l3),0(/3)), (4.8) 

since every vertex in / could possible have degree k, and since each edge counted in e(I, O) may 
affect the degree of one vertex in O. We will argue in the sequel that the contribution of |/| and 
e(J, O) is with high probability negligible in the above equations. 

First, since j3 > 1 — 4-, by applying Lemma 14.41 we obtain that with probability at least 
1 < " !!; ". 

\I(P)\ = O (V-Mi-/^ . (4.9) 
Moreover, Lemma 14.51 yields that with high probability 

e{I{fi),0{p)) = Ofn 1 -^- 1 ^ 1 -^ log nj . (4.10) 

It remains to determine the value of D k . Our assumptions on k guarantee that k < n s ' , where 
^ (2a-i) (l-ff) ^ pj-Qyjjg^ that e > is sufficiently small. Additionally, we claim that 5' < 2(2/3 — 1). 
To see this, note that since a > 1/2 

2a + 1 10a + 3 

But h(a) is increasing, and thus it is maximized at a = oo, where linia-j.oo h{ot) = 3/5. This shows 
that indeed 5' < 2(2/3 — 1). With all these facts at hand we can apply Lemma 14.61 which implies 
that 

(g , (2o-l)(l-/3K 

E[D k (f3)} = e(nk-^ a+ ^) = n(n 1 -^ + ^') ( ^^MH)^^ . (4 . n) 

Note that by Lemma 14.21 the effect on D k (/3) if we change the coordinates of of vertex i is at most 
Cj := cn 1- ^ + 1 (the plus one is for the vertex itself) as long as B holds. We therefore apply 
Theorem 14.11 with / := -D&(/3), t := n 3 / 2 -P+ 2a£ anc j '^j' event B as stated in Lemma FOl Note 
that M = n and therefore M • Pr[B] = o(l). It follows that 

Pr[|^(/3) - E[DM]\ > * + o(l)] < 2e-^* 2 /n^^-«) + e -«^ = e -»"« 

However, /3 > 1 — y- + e implies for all < k < n 5 ' that 

E[Z>fc03)] = W („l-(2«-l)(l-«) = w ^3/2-/3+2^ = w(t)> 

This shows that with probability at least 1 — e _ ™ n(1) we have for every k in the considered range that 
D k ((3) = (l + o(l))E[jDfc(j0)]. Together with (|47fl) - (|4TTll the proof of the theorem is completed. □ 
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4.3 Vertices of Large Degree 

In the previous section we derived the degree sequence for vertices of degree k < re" 5 for some 
constant 5. For k > n s we observe that the radius of almost all vertices of degree k is concentrated 
around a specific r k . We effectively show that it suffices to bound the number of vertices of radius 
at most rfc to get a tight bound on the number of vertices of deg ree at least k. 

Proof of the second part of Theorem \2.2l Let L k denote the number of vertices of degree at least 
k. Set r k := 2(log(ra - 1) - log k + log ^^ 2 "^ )), q r := fi(B r (R) n B (R)) and observe that the 
expected degree of a vertex with radius r& is by Lemma 13.21 

(n-l)q rk = in-1)- (liO^ 1 /^ + e -*)) = k[l±0 

We set e := max •! k~( l ~ 2a ) 2 I a nd observe that for a > 1/2 and for all k < r^- 

l Vfc J ' — logn 

= o(e). 





Let count the vertices of degree at least k and radius at most r& — e, those of degree at 
least k and radius at least r k + e and those of degree at least k and radius in [r k — s, r& + e] . 
Using those conditions on the radius of the points we can write the expectation of L k as 

E[L k )=E[L<]+E[L>}+E[L±). (4.12) 

We now show that the first term in (|4.12p dominates the second and the third. 

Let us first inspect E[LjjT]. By Lemma 13.31 the degree distribution of a vertex at distance at 
least rfc + e is dominated by X ~ Bin(n — 1, n(B rk+£ (R) n Bq(R))). The expectation of such a r.v. 
is by Lemma 13.21 and the above observation 

ELY] = (n - 1) 2 " 6 ' (1 + o(e)) = fc(l - e /2(l + o(l))). 
7r(a — 1/2) 



By the Chernoff bound the probability that such a vertex has degree at least k is at most 

E[Jfl(£/2(l+o(l))) 2 „ 2 \ o/i \ 

Pr[X > (1 + e/2(l + o(l)))E[X]] < e a = e" Q(log n) = n - Q ( losn ) 



and therefore 

E[L>] < nn-^( log ") = o(l). (4.13) 

For points which have distance at most rj-ea similar argument holds as the degree distribution 
of all those points dominates Y ~ Bin(n — 1, fi(B rk _ £ (R) n Bq(R))). This random variable has by 
Lemma 13.21 expectation 

E[y] = (n - l) 2 " 6 ' (1 + o(e)) = k(l + e/2(l + o(l))). 
TT{a — 1/2) 

Hence the probability that such a vertex has degree smaller than k is at most 

E[Y](e/2(l+o(l))) 2 on 2 \ o/i -i 

Pr[y < (1 - e/2(l + o(l)))E[y]] < e" 2 = e" n(log n) = n- Q(logn) (4.14) 
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and therefore E[L5] = (1 — o(l))E[X< rfc _ e ]. Recall that X< rfc _ e denotes the number of points with 
radius at most r k — e and that its expectation is by Lemma [ 



nX<r k -e] = (1 + o(l))ne-< R -^-^ = (1 + o(l)) ( 7r(Q 2a i/2) ) 



2a 

„-aC„, -2a 



nk- za . (4.15) 



For the third term in (14. 12p it suffices to look only at points which have their radius in [r& — 
e, rj- + s] and neglect whether their degree is at least k. Clearly 

E[L±] < E[X< rfc+£ ] - E[X< rfe _J (E[X< rfe _ £ ]) = o(E[L<]) (4.16) 



and we can therefore conclude that 

tr(a - 1/2" 



E[L k ] = (1 + o(l))E[X< r ._J = (1 + o(l)) ( . 2 " ) e- aC nfe- 2( ' 



i 

Note that for k < j^pj, E[L&] = f2(log 2a ra) and therefore by the Chernoff bound 



|X< rfc _ e -E[X< rfc _ £ ]| > —J E[X< rfc _ e ] 

log log n 



< ofn -1 ' 



Pr 

which proves the statement of the theorem. □ 



4.4 The Average & Maximum Degree 

In the subsequent proofs for Theorem 12.31 and Theorem \2A\ we present two simple applications of 
Lemma 13.21 In particular, we show how it can be used to determine the average degree and the 
maximum degree in G a) c{n). 

Proof of Theorem \2.S\ The average degree of a fixed vertex at radius r is by definition 

k(r) = (n-lMB r (R)nB (R)) {Lem = m (n-1) ( j^y 2) (} ± 0(e~^> + e^))) . (4.17) 

The total average degree can be obtained by integrating k{r)p{r) over all < r < R. With the 
facts 2sinh(x) = e x (l — 0(e~ 2x )) and 2cosh(x) = e x (l + Q(e~ 2x )), valid for any x > 0, it follows 
that 

= asinh(ar) = ae a(r-R) . (1 ± Q ,-2ar + e ~a R)) (418) 
cosh(ai?) — 1 

Note that since a > 1/2 and < r < R, the error terms e~ 2ar + e~ aR in (|4.18|) are dominated by 
the error terms e~^ a ~ l l 2 ^ r + e~ r in (|4.17p . Hence, by abbreviating c a = 7r ^° 1 y 2 ) we obtain 



J\{r)k(r)dr = (1 + o(l))c a n j\ a( - r ~^ ■ e~ r l 2 (l±Q (V(«-V2)r + e -r^ dr _ 



k 



The last integral is elementary. In particular, recalling that R = 2 log n + C, we obtain that 

>-l/2)r 
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(1 + (1))J_ = (l+o(l)) n 



a-l/2L Jr=o v w 'a-l/2 v w/ a-l/2 
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The integrals over the error terms are of order 

O (n J R e^-^e-^e'^-^dA = O (ne- aR dr\ = o(l) 



and 



Finally, 



O (^ne- aR j\ r{a ~ 3/2) dAj = O (ne- aR e R ^/ 2 ^ = o(l). 



k = (l + o(l))^-— = (l + (1))- 



a- 1/2 v v 7; 7r(a - 1/2) 2 ' 
and the theorem follows. □ 



Proof of Theorem \2.4\ Let X< r denote the number of vertices with radius at most r, and define X> r 
similarly. Set ro := (2 — 1/ot) log n. We will consider vertices with radius r 6 [ro — (f>(n),ro + </>(n)] 
where </>(n) = o(loglogn) tends to infinity as n — > oo. Note that for sufficiently large n 

(tern, [alt 

E[X< ro _^ (n) ] = n • /x (B (r - 0(n))) < 2 e -«^)+ c ) = o(l). (4.19) 

It follows that with high probability there are no vertices with radius at most tq — 4>(n). Moreover, 
by applying (|3.9p 

a(<t>(n)-C) 2 a{<t>{n)-C) 

^ <M(£ (r + <Kn)))< • (4.20) 

Since the number of vertices with radius < ro + (f>(n) is binomially distributed, we infer that 

Pr[X< ro+0(n) = 0] < (1 - M (Bo(ro + #n)))) n < e -' aWn) - c) /2 = (i). (4 .21) 

That is, with high probability there is a vertex with radius at most ro + <j)(n). On the other hand, 
we will argue that the number of such vertices is very small. To see this, note that (|4.20p implies 
that 

nX< ro+Hn) ]<2e a ^- c \ 
and by applying Markov's inequality we infer, since 4>{n) = o(loglogn), that 

2e a(<j>(n)-C) 

Pr[X< ro+0(n) > logn] < — = o(l). 

In other words, the number of vertices with radius larger than ro + <f>(n) is at least n — logn with 
high probability. 

Note that the degree of a vertex at radius r is distributed like Bin(n - l,p,(B (R) n B r (R))). 
As there are with high probability no vertices of radius smaller than ro — <fi(n), by Lemma 13.31 the 
degree distribution of every vertex is with high probability dominated by ~Bm(n,p~ ), where by 
Lemma 13.21 

2q r Q—4>( n ) 

p~ := n(B (R) n B rQ _^ (n) {R)) = (1 + o(l))- 



vr(a - 1/2) 



1 <Hn) 



In this case the expected degree of a fixed vertex is at most np rQ < (1 + o(l)) n2a e 2 ' anc ^ 

by a Chernoff bound and a union bound there is with high probability no vertex of degree larger 



i </>(") 



than (l + (l))_ I ^ 7 _ y n^e^ 
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The lower bound can be established by the fact (shown in (|4.2ip ) that there is with high 
probability a vertex of radius at most ro + <j){n). There are with high probability at most logn 
such vertices which means that we can fix one and have that by Lemma 13.31 its degree distribution 
dominates Bin(n — logn,p+) where by Lemma f372l 

2ct T "n+0( Tl ) 

p+ := n(Bo(R) n B ro+Hn) (R) \ B (r + 0(n))) = (1 + o(l)) _ e ~ . 

Note that the expectation of Bin(n — log u,Pq) is (1 + o{l))npQ = (1 + o(l)) ^—^j^ w^e ~ and 

1 _ 4>(n) 

is therefore by a Chernoff bound with high probability not smaller than 7r ( Q °i / /2) n ^° e ~ ■ '— ' 
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